Optimal Sequential Detection in Multi-stream Data
نویسندگان
چکیده
Consider a large number of detectors each generating a data stream. The task is to detect online, distribution changes in a small fraction of the data streams. Previous approaches to this problem include the use of mixture likelihood ratios and sum of CUSUMs. We provide here extensions and modifications of these approaches that are optimal in detecting normal mean shifts. We show how the (optimal) detection delay depends on the fraction of data streams undergoing distribution changes as the number of detectors goes to infinity. There are three detection domains. In the first domain for moderately large fractions, immediate detection is possible. In the second domain for smaller fractions, the detection delay grows logarithmically with the number of detectors, with an asymptotic constant extending those in sparse normal mixture detection. In the third domain for even smaller fractions, the detection delay lies in the framework of the classical detection delay formula of Lorden. We show that the optimal detection delay is achieved by the sum of detectability score transformations of either the partial scores or CUSUM scores of the data streams.
منابع مشابه
Optimal overhaul–replacement policy for a multi-degraded repairable system sold with warranty
In this research, we study an optimal overhaul–replacement policy of a multi-degraded repairable system sold with a free replacement warranty. In the proposed replacement policy, a maintenance action and failure are dependent on a system degradation level and the system age, and hence the replacement model will provide more effective maintenance decisions. Failure of the system is modeled using...
متن کاملSequential Detection with Limited Memory
Sequential tests outperform fixed sample size tests by requiring fewer samples on average to achieve the same level of error performance. The Sequential Probability Ratio Test (SPRT) has been suggested by Wald [1] for sequential binary hypothesis testing problems. SPRT recursively calculates the likelihood of an observed data stream and requires this likelihood to be stored in memory between sa...
متن کاملExtraction de motifs séquentiels dans les flux de données. (Sequential patterns mining from data streams)
In recent years, many applications dealing with data generated continuously and at high speeds have emerged. These data are now quali ed as data streams. Dealing with potentially in nite quantities of data imposes constraints that raise many processing problems. As an example of such constraints we have the inability to block the data stream as well as the need to produce results in real time. ...
متن کاملChange detection from satellite images based on optimal asymmetric thresholding the difference image
As a process to detect changes in land cover by using multi-temporal satellite images, change detection is one of the practical subjects in field of remote sensing. Any progress on this issue increase the accuracy of results as well as facilitating and accelerating the analysis of multi-temporal data and reducing the cost of producing geospatial information. In this study, an unsupervised chang...
متن کاملRanking Sequential Patterns with Respect to Significance
We present a reliable universal method for ranking sequential patterns (itemset-sequences) with respect to significance in the problem of frequent sequential pattern mining. We approach the problem by first building a probabilistic reference model for the collection of itemsetsequences and then deriving an analytical formula for the frequency for sequential patterns in the reference model. We r...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016